Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix deconv 3D post OPs segment fault issue. #130

Conversation

luweizhou2016
Copy link
Collaborator

@luweizhou2016 luweizhou2016 commented Jun 9, 2022

Fix missing update post-ops argument offset to RSP when pushing.

OpenVINO pr:
openvinotoolkit/openvino#11836

@luo-cheng2021 luo-cheng2021 merged commit 60881ec into openvinotoolkit:v2.6_for_ie_master Jun 9, 2022
luweizhou2016 added a commit that referenced this pull request Jun 15, 2023
[FEATURE] Introduced Depthwise and Quantization post ops
luweizhou2016 pushed a commit that referenced this pull request Jun 15, 2023
…ta pointers

* Fix the deconv fused with depthwise issue in cpuFuncTests

* Switch to use jit_uni_depthwise_injector API.

* Fix potential conflicts in registers and YMM.

* Update with optimization.

* ONEDNN 3.2 migration squashed commits:

- Preserve bf16emu scratch register when conflict with legacy post ops
- fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
- gemm convolution: fixed bias offset
- Fix onednn gemm JIT/gemm  kernel scale.
- Luwei/fix deconv 3d postops bug (#136)
- Fix deconv 3D post OPs segment fault issue. (#130)
- Fix incorrect offset to rsp
- fix depthwise nwc conv
- fix legacyOps with stock src_zero_point in jit_avx512_core_amx
- fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
luweizhou2016 pushed a commit that referenced this pull request Jun 15, 2023
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.
luweizhou2016 pushed a commit that referenced this pull request Jun 29, 2023
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.
luweizhou2016 pushed a commit that referenced this pull request Dec 20, 2023
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.
luweizhou2016 pushed a commit that referenced this pull request Jan 10, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.
luweizhou2016 pushed a commit that referenced this pull request Jun 14, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.
luweizhou2016 pushed a commit that referenced this pull request Jul 18, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.

ONEDNN 3.5 migration squshed commmits:
[FIX][FORK][FEATURE] Introduced Depthwise and Quantization post ops
luweizhou2016 pushed a commit that referenced this pull request Jul 19, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.

ONEDNN 3.5 migration squshed commmits:
[FIX][FORK][FEATURE] Introduced Depthwise and Quantization post ops
luweizhou2016 added a commit that referenced this pull request Jul 24, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.

ONEDNN 3.5 migration squshed commmits:
[FIX][FORK][FEATURE] Introduced Depthwise and Quantization post ops
luweizhou2016 pushed a commit that referenced this pull request Jul 24, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.

ONEDNN 3.5 migration squshed commmits:
[FIX][FORK][FEATURE] Introduced Depthwise and Quantization post ops
luweizhou2016 pushed a commit that referenced this pull request Jul 24, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.

ONEDNN 3.5 migration squshed commmits:
[FIX][FORK][FEATURE] Introduced Depthwise and Quantization post ops
luweizhou2016 pushed a commit that referenced this pull request Jul 24, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.

ONEDNN 3.5 migration squshed commmits:
[FIX][FORK][FEATURE] Introduced Depthwise and Quantization post ops

[FORK] [FIX] Fix legacy zero point issue on AVX2 + AVX512.
[FEATURE]Migrate legacy post ops and zero points on runtime data pointers

[FORK][FIX] jit_uni_dw_conv_row_f32: fixed post ops start idx
[FEATURE] Migrate legacy post ops and zero points on runtime data pointers
luweizhou2016 pushed a commit that referenced this pull request Jul 24, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.

ONEDNN 3.5 migration squshed commmits:
[FIX][FORK][FEATURE] Introduced Depthwise and Quantization post ops

[FORK] [FIX] Fix legacy zero point issue on AVX2 + AVX512.
[FEATURE]Migrate legacy post ops and zero points on runtime data pointers

[FORK][FIX] jit_uni_dw_conv_row_f32: fixed post ops start idx
[FEATURE] Migrate legacy post ops and zero points on runtime data pointers

[Fix] Update the attr checking caused by forked onednn.
luweizhou2016 pushed a commit that referenced this pull request Jul 24, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.

ONEDNN 3.5 migration squshed commmits:
[FIX][FORK][FEATURE] Introduced Depthwise and Quantization post ops

[FORK] [FIX] Fix legacy zero point issue on AVX2 + AVX512.
[FEATURE]Migrate legacy post ops and zero points on runtime data pointers

[FORK][FIX] jit_uni_dw_conv_row_f32: fixed post ops start idx
[FEATURE] Migrate legacy post ops and zero points on runtime data pointers

[Fix] Update the attr checking caused by forked onednn.

[FORK][FIX][x64] Add proper post op checks to gemm_conv is split
particially and squashed.
luweizhou2016 pushed a commit that referenced this pull request Jul 24, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.

ONEDNN 3.5 migration squshed commmits:
[FIX][FORK][FEATURE] Introduced Depthwise and Quantization post ops

[FORK] [FIX] Fix legacy zero point issue on AVX2 + AVX512.
[FEATURE]Migrate legacy post ops and zero points on runtime data pointers

[FORK][FIX] jit_uni_dw_conv_row_f32: fixed post ops start idx
[FEATURE] Migrate legacy post ops and zero points on runtime data pointers

[Fix] Update the attr checking caused by forked onednn.

[FORK][FIX][x64] Add proper post op checks to gemm_conv is split
particially and squashed.

[FORK][FIX] jit_uni_dw_conv_kernel_f32: fixed register conflict
luweizhou2016 pushed a commit that referenced this pull request Jul 24, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.

ONEDNN 3.5 migration squshed commmits:
[FIX][FORK][FEATURE] Introduced Depthwise and Quantization post ops

[FORK] [FIX] Fix legacy zero point issue on AVX2 + AVX512.
[FEATURE]Migrate legacy post ops and zero points on runtime data pointers

[FORK][FIX] jit_uni_dw_conv_row_f32: fixed post ops start idx
[FEATURE] Migrate legacy post ops and zero points on runtime data pointers

[Fix] Update the attr checking caused by forked onednn.

[FORK][FIX][x64] Add proper post op checks to gemm_conv is split
particially and squashed.

[FORK][FIX] jit_uni_dw_conv_kernel_f32: fixed register conflict
luweizhou2016 pushed a commit that referenced this pull request Jul 25, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.

ONEDNN 3.5 migration squshed commmits:
[FIX][FORK][FEATURE] Introduced Depthwise and Quantization post ops

[FORK] [FIX] Fix legacy zero point issue on AVX2 + AVX512.
[FEATURE]Migrate legacy post ops and zero points on runtime data pointers

[FORK][FIX] jit_uni_dw_conv_row_f32: fixed post ops start idx
[FEATURE] Migrate legacy post ops and zero points on runtime data pointers

[Fix] Update the attr checking caused by forked onednn.

[FORK][FIX][x64] Add proper post op checks to gemm_conv is split
particially and squashed.

[FORK][FIX] jit_uni_dw_conv_kernel_f32: fixed register conflict

[FIX] SPlit the removed unused variables
luweizhou2016 pushed a commit that referenced this pull request Jul 25, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.

ONEDNN 3.5 migration squshed commmits:
[FIX][FORK][FEATURE] Introduced Depthwise and Quantization post ops

[FORK] [FIX] Fix legacy zero point issue on AVX2 + AVX512.
[FEATURE]Migrate legacy post ops and zero points on runtime data pointers

[FORK][FIX] jit_uni_dw_conv_row_f32: fixed post ops start idx
[FEATURE] Migrate legacy post ops and zero points on runtime data pointers

[Fix] Update the attr checking caused by forked onednn.

[FORK][FIX][x64] Add proper post op checks to gemm_conv is split
particially and squashed.

[FORK][FIX] jit_uni_dw_conv_kernel_f32: fixed register conflict

[FIX] SPlit the removed unused variables

[FIX] fix avx512 bf16 dw stack pointer
luweizhou2016 pushed a commit that referenced this pull request Jul 25, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.

ONEDNN 3.5 migration squshed commmits:
[FIX][FORK][FEATURE] Introduced Depthwise and Quantization post ops

[FORK] [FIX] Fix legacy zero point issue on AVX2 + AVX512.
[FEATURE]Migrate legacy post ops and zero points on runtime data pointers

[FORK][FIX] jit_uni_dw_conv_row_f32: fixed post ops start idx
[FEATURE] Migrate legacy post ops and zero points on runtime data pointers

[Fix] Update the attr checking caused by forked onednn.

[FORK][FIX][x64] Add proper post op checks to gemm_conv is split
particially and squashed.

[FORK][FIX] jit_uni_dw_conv_kernel_f32: fixed register conflict

[FIX] SPlit the removed unused variables

[FIX] fix avx512 bf16 dw stack pointer

[ARM] Fixed legacy post-ops changes for ARM target
luweizhou2016 added a commit that referenced this pull request Jul 26, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.

ONEDNN 3.5 migration squshed commmits:
[FIX][FORK][FEATURE] Introduced Depthwise and Quantization post ops

[FORK] [FIX] Fix legacy zero point issue on AVX2 + AVX512.
[FEATURE]Migrate legacy post ops and zero points on runtime data pointers

[FORK][FIX] jit_uni_dw_conv_row_f32: fixed post ops start idx
[FEATURE] Migrate legacy post ops and zero points on runtime data pointers

[Fix] Update the attr checking caused by forked onednn.

[FORK][FIX][x64] Add proper post op checks to gemm_conv is split
particially and squashed.

[FORK][FIX] jit_uni_dw_conv_kernel_f32: fixed register conflict

[FIX] SPlit the removed unused variables

[FIX] fix avx512 bf16 dw stack pointer

[ARM] Fixed legacy post-ops changes for ARM target
luweizhou2016 added a commit that referenced this pull request Jul 29, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.

ONEDNN 3.5 migration squshed commmits:
[FIX][FORK][FEATURE] Introduced Depthwise and Quantization post ops

[FORK] [FIX] Fix legacy zero point issue on AVX2 + AVX512.
[FEATURE]Migrate legacy post ops and zero points on runtime data pointers

[FORK][FIX] jit_uni_dw_conv_row_f32: fixed post ops start idx
[FEATURE] Migrate legacy post ops and zero points on runtime data pointers

[Fix] Update the attr checking caused by forked onednn.

[FORK][FIX][x64] Add proper post op checks to gemm_conv is split
particially and squashed.

[FORK][FIX] jit_uni_dw_conv_kernel_f32: fixed register conflict

[FIX] SPlit the removed unused variables

[FIX] fix avx512 bf16 dw stack pointer

[ARM] Fixed legacy post-ops changes for ARM target
luweizhou2016 added a commit that referenced this pull request Jul 29, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.

ONEDNN 3.5 migration squshed commmits:
[FIX][FORK][FEATURE] Introduced Depthwise and Quantization post ops

[FORK] [FIX] Fix legacy zero point issue on AVX2 + AVX512.
[FEATURE]Migrate legacy post ops and zero points on runtime data pointers

[FORK][FIX] jit_uni_dw_conv_row_f32: fixed post ops start idx
[FEATURE] Migrate legacy post ops and zero points on runtime data pointers

[Fix] Update the attr checking caused by forked onednn.

[FORK][FIX][x64] Add proper post op checks to gemm_conv is split
particially and squashed.

[FORK][FIX] jit_uni_dw_conv_kernel_f32: fixed register conflict

[FIX] SPlit the removed unused variables

[FIX] fix avx512 bf16 dw stack pointer

[ARM] Fixed legacy post-ops changes for ARM target
azhai219 pushed a commit that referenced this pull request Dec 4, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (#128)
    - Luwei/fix deconv 3d postops bug (#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.

ONEDNN 3.5 migration squshed commmits:
[FIX][FORK][FEATURE] Introduced Depthwise and Quantization post ops

[FORK] [FIX] Fix legacy zero point issue on AVX2 + AVX512.
[FEATURE]Migrate legacy post ops and zero points on runtime data pointers

[FORK][FIX] jit_uni_dw_conv_row_f32: fixed post ops start idx
[FEATURE] Migrate legacy post ops and zero points on runtime data pointers

[Fix] Update the attr checking caused by forked onednn.

[FORK][FIX][x64] Add proper post op checks to gemm_conv is split
particially and squashed.

[FORK][FIX] jit_uni_dw_conv_kernel_f32: fixed register conflict

[FIX] SPlit the removed unused variables

[FIX] fix avx512 bf16 dw stack pointer

[ARM] Fixed legacy post-ops changes for ARM target
azhai219 pushed a commit to azhai219/oneDNN that referenced this pull request Dec 6, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (openvinotoolkit#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (openvinotoolkit#128)
    - Luwei/fix deconv 3d postops bug (openvinotoolkit#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.

ONEDNN 3.5 migration squshed commmits:
[FIX][FORK][FEATURE] Introduced Depthwise and Quantization post ops

[FORK] [FIX] Fix legacy zero point issue on AVX2 + AVX512.
[FEATURE]Migrate legacy post ops and zero points on runtime data pointers

[FORK][FIX] jit_uni_dw_conv_row_f32: fixed post ops start idx
[FEATURE] Migrate legacy post ops and zero points on runtime data pointers

[Fix] Update the attr checking caused by forked onednn.

[FORK][FIX][x64] Add proper post op checks to gemm_conv is split
particially and squashed.

[FORK][FIX] jit_uni_dw_conv_kernel_f32: fixed register conflict

[FIX] SPlit the removed unused variables

[FIX] fix avx512 bf16 dw stack pointer

[ARM] Fixed legacy post-ops changes for ARM target
azhai219 pushed a commit to azhai219/oneDNN that referenced this pull request Dec 9, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (openvinotoolkit#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (openvinotoolkit#128)
    - Luwei/fix deconv 3d postops bug (openvinotoolkit#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.

ONEDNN 3.5 migration squshed commmits:
[FIX][FORK][FEATURE] Introduced Depthwise and Quantization post ops

[FORK] [FIX] Fix legacy zero point issue on AVX2 + AVX512.
[FEATURE]Migrate legacy post ops and zero points on runtime data pointers

[FORK][FIX] jit_uni_dw_conv_row_f32: fixed post ops start idx
[FEATURE] Migrate legacy post ops and zero points on runtime data pointers

[Fix] Update the attr checking caused by forked onednn.

[FORK][FIX][x64] Add proper post op checks to gemm_conv is split
particially and squashed.

[FORK][FIX] jit_uni_dw_conv_kernel_f32: fixed register conflict

[FIX] SPlit the removed unused variables

[FIX] fix avx512 bf16 dw stack pointer

[ARM] Fixed legacy post-ops changes for ARM target
azhai219 pushed a commit to azhai219/oneDNN that referenced this pull request Dec 9, 2024
…ta pointers

ONEDNN 3.2 migration squashed commits:
    - fix depthwise nwc conv
    - Fix deconv 3D post OPs segment fault issue. (openvinotoolkit#130)
    - fix in avx2 conv+fakequant post ops +nxc last channel wrong result (openvinotoolkit#128)
    - Luwei/fix deconv 3d postops bug (openvinotoolkit#136)
        -- Fix the deconv fused with depthwise issue in cpuFuncTests
        -- Switch to use jit_uni_depthwise_injector API.
        -- Fix potential conflicts in registers and YMM.
        -- Update with optimization.
    - fix legacyOps with stock src_zero_point in jit_avx512_core_amx
    - Fix incorrect offset to rsp
    - Preserve bf16emu scratch register when conflict with legacy post ops
    - fix per-OC legacyPostOps for jit_avx512_dw_conv_fwd_kernel_bf16
    - Fix segment fault caused by dest scale.

ONEDNN 3.5 migration squshed commmits:
[FIX][FORK][FEATURE] Introduced Depthwise and Quantization post ops

[FORK] [FIX] Fix legacy zero point issue on AVX2 + AVX512.
[FEATURE]Migrate legacy post ops and zero points on runtime data pointers

[FORK][FIX] jit_uni_dw_conv_row_f32: fixed post ops start idx
[FEATURE] Migrate legacy post ops and zero points on runtime data pointers

[Fix] Update the attr checking caused by forked onednn.

[FORK][FIX][x64] Add proper post op checks to gemm_conv is split
particially and squashed.

[FORK][FIX] jit_uni_dw_conv_kernel_f32: fixed register conflict

[FIX] SPlit the removed unused variables

[FIX] fix avx512 bf16 dw stack pointer

[ARM] Fixed legacy post-ops changes for ARM target
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants